Dynamics of force-induced DNA slippage 
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We study the basepairing dynamics of DNA with repetitive sequences where local strand slippage 
can create, annihilate, and move bulge loops. Using an explicit theoretical model, we find a rich 
dynamical behavior as a function of an applied shear force /: reptation-like dynamics at / = / c 
with a rupture time r scaling as iV 3 with its length N, drift-diffusion dynamics for f c < f < /*, 
and a dynamical transition to an unraveling mode of strand separation at / = /*. We predict a 
viscoelastic behavior for periodic DNA with time and force scales that can be programmed into its 
sequence. 



The dynamics of basepairing in DNA and RNA 
molecules plays an important role in biological processes 
such as DNA replication, transcription and RNA folding 

. These dynamics can be probed in detail with modern 
single molecule techniques to exert and measure piconew- 
ton forces with nanometer spatial resolution . For in- 
stance, double-stranded DNA (dsDNA) can be forced to 
open either by pulling on the two strands from the same 
end of the dsDNA ('unzipping') 0,0,0] or fr° m opposite 
ends ('shearing') [(j. In the case of unzipping, the dynam- 
ics involves the consecutive opening of native basepairs, 

1. e. those present in the ground state of the molecule, and 
is well understood theoretically Q. Here, we consider in- 
stead the shearing of dsDNA and focus specifically on 
periodic DNA sequences. This case is particularly inter- 
esting both from a physical and a biological point of view, 
since (i) periodic sequences have many non-native base- 
pairing conformations where one strand is shifted with 
respect to the other, (ii) shearing probes the transitions 
between such states, i.e. the dynamics of DNA slippage, 
see Fig.^ and (hi) DNA slippage during genome replica- 
tion allows the expansion of nucleotide repeats, and, for 
certain repeats inside genes, triggers a variety of diseases 
including Huntington's disease [8(. 

The mechanism for DNA slippage has already been 
suggested by Porschke Q, see Fig.^,: small bulge loops 
can form at one end of the molecule when a few bases 
spontaneously unbind and rebind shifted by one or sev- 
eral repeat units. Once formed, a bulge loop may diffuse 
along the molecule and anneal at the other end, effec- 
tively sliding the two strands against each other by a 
length equal to the size of the bulge loop. This mecha- 
nism involves only small energetic barriers compared to 
the large barrier for complete unbinding and reassocia- 
tion. Here, we present a detailed theoretical study of 
force-induced DNA slippage, which has so far not been 
studied experimentally. We show that this system dis- 
plays a rich dynamical behavior which can be controlled 
experimentally by adjusting the force, sequence length, 
and sequence composition. 

Model. — We consider a dsDNA of two perfectly com- 
plementary periodic sequences with N repeat units, each 
consisting of m nucleotides (for simplicity, we refer to re- 



peat units also as 'bases'). Assuming that basepairing 
within a strand is negligible, a basepairing configuration 
is specified by the set of the n < N inter-strand base- 
pairs S = {(ui,li)} with I < ui < U2 < . . . < u n < N 
for the 'upper' strand and analogously for the U in the 
'lower' strand. We assign a binding energy — < to 
each basepair and a loop cost Et(J) > when there are 
j > unpaired bases (total on both strands) between 
two consecutive basepairs. With a constant shear force 
/, see Fig. the energy of a configuration S is 



E[S] = -e b n[S] 



n[S] 



fL[S] , (1) 



i=2 



where Aui — Ui — Ui-\ and AZ; = ?j — The 
loop cost Ee(j) increases with the loop length, start- 
ing from Ei(0) = 0. Free DNA (/ = 0) is described 
by Ei(j) = si + 3 v fcsTln(j), with a loop initiation cost 
£(, > and a logarithmic asymptotic behavior derived 
from polymer theory (y w 0.6 is the Flory exponent) |l0| . 
An applied force can affect Ei(j), however our qualita- 
tive results are insensitive to its precise form Un- 
less stated otherwise, we keep only the constant term, 
Et(j>0) = £e, for simplicity. The total extension L is 



L[S] =£ a (u 1 -l + N-l n )+£, 



d ^min(AMi, A/i) , (2) 

i=2 
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FIG. 1: Sketch of periodic dsDNA, where each bead repre- 
sents one repeat unit consisting of one or several bases, (a) 
Many microscopic slippage events can lead to macroscopic 
sliding, (b) An applied shear force. 




where l& and £ s > id are the effective lengths (in the 
direction of the force) per single and double stranded 
unit, respectively. The entropic elasticity of DNA 
causes both £ d and l a to depend on the applied force, 
however since the DNA is almost fully stretched at the 
forces of interest here, we use the constant values £&/m — 
3.4 A and £ s /m = 7 A for simplicity [i"^ . 

We study the dynamics of our model both with an- 
alytical methods (described below) and a Monte Carlo 
approach using three elementary moves jl4| : opening, 
closing, and slippage of a single basepair, i.e. a pair 
(ui,k) is removed from the set S or added to it, or, if 
the basepair is adjacent to a loop, either itj or ij can be 
changed to another base inside the loop. The absolute 
timescale of these dynamics is hard to predict, but com- 
parison with bulk reannealing experiments |9( suggests 
that our simulation time step is on the order of /is in real 
time. 

Scaling of mean rupture times. — With a constant ap- 
plied force />0, eventually every finite dsDNA will rup- 
ture, since complete separation of the strands (L — > oo) 
is the state of minimal free energy. However, both the 
timescale and the nature of the rupture dynamics depend 
drastically on the force. Fig. displays the scaling of the 
mean rupture time (r) with the number of bases TV for a 
number of different forces (see caption for parameters). 
We observe four distinct asymptotic behaviors: an expo- 
nential increase with A for small forces, a cubic scaling 
with A at a certain force f c , a nearly quadratic scaling 
above f c but below a second threshold /*, and linear 
scaling above /*. The behavior in the two extremes is 
easily interpreted: for small /, rupture is driven by ther- 
mal fluctuations across a large free energy barrier with an 
associated Kramers time that scales exponentially with 
N, and linear scaling at large / is expected when individ- 
ual bonds break sequentially at a constant rate. We now 
characterize the rich behavior in the intermediate force 
regime, including the nature of the two transitions. 

The thermodynamic energy barrier disappears at a 
force f c which can be estimated by balancing the bind- 
ing energy per basepair with the mechanical work exerted 
when sliding both strands against each other by one step, 

f c « e b /(24 - i d ) . (3) 

f c is a critical force in the thermodynamic sense, if the 
state of complete rupture is excluded (see below for 
the exact calculation including all basepairing configu- 
rations). At f — f Cl the rupture dynamics is best under- 
stood by analogy with the reptation problem (lif. since 
bulge loops in the DNA structure behave similarly to the 
"stored length" excitations of a single chain in a polymer 
network: these excitations are generated at the ends of 
the polymer with constant rate independent of A, dif- 
fuse along the polymer and reach the other end with a 
probability ~ A~F Therefore, the macroscopic diffusion 
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FIG. 2: Scaling of the mean rupture time (r) with the num- 
ber of bases N for different shear forces (with Eb = 1.11, 
£^ = 2.8, which roughly corresponds to AT-sequences at 50°C, 
see Fig. 0. The symbols represent Monte Carlo data (er- 
ror less than symbol size). The solid lines for / > f c are 
power law fits (exponent with error in least significant digit 
is given; data with N < 40 shows significant finite size devia- 
tions and is excluded). For f < f c the rupture time increases 
exponentially. The data for / = 6.4 pN > f c (connected by 
the dashed line) , demonstrates the crossover from diffusive to 
drift behavior, see main text . The data for / = 23 pN is cal- 
culated including the logarithmic loop cost, which becomes 
relevant at large forces [llj . 

constant for the relative motion of the two DNA strands 
should scale as D ~ A -1 and the time for diffusion over 
distance A is ~ A 3 . 

For f > f c strand separation is energetically a downhill 
process, which induces a drift velocity v between the two 
strands. In linear response, we expext v = ji A/ for 
small A/ — f — f c with a mobility mediated by bulge 
loop diffusion, fi = D/ksT ~ A -1 (from the Einstein 
relation), leading to (r) ~ A 2 . Why does this behavior 
not persist for large forces? The second transition in the 
scaling behavior is due to a change in the rupture mode: 
at forces larger than /* « £b/(4 — ^d) the double strand 
can open by unraveling from both ends, i.e. the energy 
cost £b of opening a basepair at the end is outweighed by 
the gain f(£ s — £d) from a longer base-to-base distance in 
the single strand. In this unraveling mode, the rupture 
time scales linearly with N. The dynamical transition 
from sliding to unraveling is clearly reflected in the length 
at rupture, L[S(t)], see Fig. which is roughly a factor 
of two larger for sliding. 

Rupture time distributions. — Single molecule setups 
are ideally suited to record the full distribution of rup- 
ture times, P(t), which is a sensitive characteristic of the 
dynamics and permits a close examination of the physi- 
cal picture introduced above. The histograms in Fig. 0] 
show P(t) from simulations at f — f c and a larger force 
f c < f < f* , see caption for parameters. We observe 
that fluctuations play a dominant role at / ' = f c , i.e. the 
width of P(t) is comparable to the mean, while the rup- 
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FIG. 3: (a) Rupture length as a function of applied force / 

(parameters as in Fig. (b) Drift coeff. «o(/) extracted 

from simulations with N = 150 (circles) and analytical curve 

(solid line, fco = 1-87), see main text. 
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FIG. 4: Histogram of rupture times for two different forces, 
(a) 34.4 pN and (b) / c =16.6 pN, but the same set of DNA 
parameters, N — 80, £b = 3.75, et = 2.6, which roughly corre- 
spond to a CG-sequence at room temperature, see Fig. 



ture dynamics is drift dominated at the larger force, with 
a localized peak in P(r). 

To formulate the drift-diffusion dynamics quantita- 
tively, we treat the number of bases in the double- 
stranded region as a continuum variable x with < 
x < N, and consider the probability distribution V(x,t), 
which satisfies the continuity eq. dtP(x,i) = —d x j(x,t) 
with a force-dependent current 

j(x, t) = -D(f, x) d x V(x, t) - v(f, x) V(x, t) . (4) 

The above discussion suggests a diffusion coefficient 
of the form D(f.x) = Do(f)/x and similarly a drift 
v(f,x) = vo(f)/x. We have an absorbing boundary at 
x = and it is natural to choose a reflecting boundary 
at x = N and a delta peak at x — N as initial con- 
dition. The solution V(x,t), which must in general be 
obtained numerically, determines the rupture time dis- 
tribution through P(t) = j(0,r). 

We can determine the force-dependence of the diffu- 
sion coefficient and drift empirically by fitting the calcu- 
lated P(t) to the simulation data using Dq and vo as ad- 
justable parameters. The solid lines in Fig. 21 show that 
the drift-diffusion theory describes the simulation data 
well. Fig. 03 shows the fitted vo as a function of / (cir- 
cles). The drift vanishes at the critical force, t>o(/ c ) = 0, 
confirming the physical picture. The drift-diffusion the- 
ory also explains the crossover behavior in the vicinity of 
/ = /cj se e Fig. [5] the drift is significant only when the 
system size N is larger than the diffusive length Do/vq 
|l6| . Hence, with ~ A/, reptation-like dynamics is 
expected in a force interval Sf ~ A^ 1 around f c . 

Microscopic dynamics. — Next, we study how the 
macroscopic drift in eq. (J3J emerges from the microscopic 
bulge loop dynamics and determine Uo(/) m terms of our 
system parameters. Since bulge loops on opposite strands 
annihilate each other when they meet, the bulge loop dy- 
namics is equivalent to a reaction-diffusion system of par- 
ticles and antiparticles in one dimension. Both particles 
and antiparticles are created at each end, however with 
different rates determined by the applied force. From 
the underlying master equation for these processes one 



obtains the mean-field equations 

d t u(y,t) = k dyU(y,t) -k-Lu(y,t)l(y,t) + k 2 , 
d t l(y,t) = k d*l(y,t)-k 1 u(y,t)l(y,t) + k 2 , (5) 

where u(y, t) and l(y, t) denote the bulge loop density on 
the upper/lower strand, y £ [0, a;] is the position within 
the double stranded region, and ko, hi, k 2 are the rates 
for hopping, annihilation, and pair creation, respectively. 
At the boundaries, the densities take on constant values, 
u(0,t) = l(x,t) = p< and u(x,t) = 1(0, t) — p > , where 
p<(f) and p>(f) are calculated below by assuming a lo- 
cal equilibrium of the DNA at the edges. The macro- 
scopic drift is determined by the stationary solution and 
depends only on the difference between the loop densities 
on the upper/lower strand, v(f,x) = k^d y [u(y) — l(y)}. 
Using eq. © this yields v (f) = 2k [ P> (f) - p<(f)]. 
Fig. |2t> shows that this result is in excellent agreement 
with the empirical vo(f) obtained above. 

Since the loop cost Ei(j) is larger for two separate 
loops than for a single one of the combined length, bulge 
loops on the same strand feel a short-range attraction. 
However, the interaction is not strong enough to cause a 
significant aggregation of the loops in our Monte Carlo 
simulations. This is consistent with the observation that 
with our DNA parameters, the interaction energy S£ is 
never significantly larger than the entropic cost ~ log/9 
of colocalization at loop density p. While i>o(f) is appar- 
ently robust to interaction effects, the diffusion coefficient 
Do(f) is sensitive to interactions as well as correlations. 
Both are neglected in eq. (0, leaving the microscopic 
calculation of Dq(J) as a challenge for the future. 

Critical force. — To obtain the exact critical force, we 
need the partition function Z = ^ e~ E ^ s ^ kBT summed 
over all configurations S with at least one basepair. ft 
is useful to allow for different numbers of bases in the 
two strands, e.g. I < u.- L < N and 1 < < M, with a 
corresponding partition function 

N-1M-1 N-iM-j 

z(n, M) = J2Y1 E E z >> ™) > ( fi ) 

i— j—0 n— 1 m—1 

where b s — e^ l "/ kBT is the Boltzmann factor for a 
stretched base, and Z p (n, m) is the partition function for 
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FIG. 5: (a) Model parameters for different DNA sequences 
and temperatures as obtained by fitting to a detailed thermo- 
dynamic model [Til Ht| (all energies in units of kT). (b) The 
exact critical force compared to the estimate of eq. @. 



the central, double stranded section starting with the first 
and ending with the last basepair, cf. Fig. \iy>. We cal- 
culate Z p (n, m) recursively by introducing a complemen- 
tary partition function Z u (n,m) containing only struc- 
tures where the last of the n upper bases is not bound to 
the last of the m lower bases: 

Z p (n+l,m+l) = qb d Z p (n,m)+qb d gZ u (n,m) , (7) 

n m 

Z u (n+l,m + l) = g /J Z p (k, m + 1) + g >J Z p (n+1, fc) 

k=l k=l 

+ gb d Z p (n,m) + b d Z u (n,m) . 



Here, the Boltzmann factors q = e t 



and b d 



a f£ d /k B T 



fJ 



-ee/2k B T 



account for basepairing, loop costs, 



and stretching of double strand, respectively. These re- 
cursion relations allow the efficient calculation of the par- 
tition function for finite N, M. To obtain the critical be- 
havior for N — > oo, we take the z-transform Z(z, y) — 
Y^n M Z(N, M)z N y M . The inverse z-transform is then 
determined by the simultaneous poles of Z{z, y) in z and 
y, and for large N the pair of poles with the smallest \zy\ 
dominates. A detailed analysis of the critical behavior 
will be presented elsewhere |llj : here we are interested 
in f c , i.e. the force where the dominant pole switches. 
We find that f c is the nontrivial root of 



-l\-9 z q 



b s -l b d 



1 =0 



(8) 



When £b or eg 3> fc^T, the second term is negligible and 
the nontrivial root of © is b^/b d = q, recovering the 
naive estimate 0. However, for smaller £b, £e one finds 
significant deviations from ©, see Fig. EI 

Loop densities. — Using the same approach as for 
the calculation of f c , we can calculate the loop den- 
sities p<, ,o> introduced above. Assuming local equi- 
libration at the edges, i.e. equilibration between all 
possible conformations of the two strands with a fixed 
central basepair, we find p< = b P (a, b) a/A and 
p> = ^2 a f,P(o,,b)b/X, where A = min(a,6) + l and 

P{a,b) = b s b - a b d x qgZ p (N-b-l,N-b-l)/ZJN,N). 
The sums can be evaluated exactly for large 



Conclusions. — We find a response of periodic dsDNA 
to shear forces that is very distinct from that for nonperi- 
odic sequences. Above a thermodynamic critical force / c , 
but below a dynamic critical force /*, bulge loop diffusion 
allows periodic DNA to open by sliding. This mechanism 
leads to a much lower thermodynamic critical force than 
the unraveling mechanism by which nonpcriodic DNA 
opens. Within our model, we have calculated f c exactly 
and characterized the associated dynamics, which is ef- 
fectively viscoelastic with a creep compliance ~ A^ _1 for 
fc < f < /*■ Above /* periodic dsDNA also opens 
predominantly by unraveling (this dynamical transition 
may be regarded as a remnant of the thermodynamic 
transition for nonperiodic sequences). Interestingly, pe- 
riodic DNA could be used as a viscoelastic nanomechan- 
ical element with properties that are programmable by 
choosing sequence length and composition. This may 
lead to applications in microstructured devices, similar 
to the prog rammable DNA-based force sensors reported 
in Ref. . 
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